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The complete nucleotide sequence of the ns2 gene of 
human coronavirus OC43 (HCY-OC43) was deter¬ 
mined. Sequence analysis revealed an open reading 
frame that could encode a protein of 278 amino acids, 
with an estimated molecular mass of 32-2 kDa. Six 
potential phosphorylation sites are present but no sites 
of A-glycosylation were found. The amino acid sequence 
of the HCV-OC43 ns2 protein shows 92 % identity with 


that of the Mebus strain of bovine coronavirus (BCV). 
However, a stretch of nine consecutive amino acids near 
the C terminus is completely different, causing it to be 
very hydrophilic, which contrasts with the hydrophobic 
nature of this region in BCV. As shown by immuno¬ 
fluorescence with a monospecific antiserum, the ns2 
protein was expressed in the cytoplasm of HCV-OC43- 
infected HRT-18 cells. 


Coronaviruses are enveloped viruses that contain a 
single-stranded positive sense RNA genome of 27 to 
31 kb (Boursnell et al., 1987; Lee et al., 1991). The 
genomic RNA encodes six to eight capped and poly- 
adenylated subgenomic mRNAs that are arranged in a 
3'-coterminal nested set structure. Each mRNA possesses 
a common 5' end leader sequence derived from the 5' end 
of the genomic RNA. It has been suggested that the 
interaction of the 3' end of the leader sequence with the 
full-length minus strand genomic RNA, at the consensus 
intergenic sequences, initiates the synthesis of sub¬ 
genomic mRNAs of diverse lengths (Lai, 1990). 

Human coronaviruses (HCV) have so far been repre¬ 
sented by two prototype strains, OC43 and 229E, which 
belong to two distinct antigenic groups. HCV-OC43 
shares antigenicity with bovine coronavirus (BCV), 
haemagglutinating encephalomyelitis virus of swine 
(HEV) and murine hepatitis virus (MHV). HCV-229E is 
antigenically related to porcine transmissible gastroen¬ 
teritis virus (TGEV) and canine coronavirus (CCV). 
Human coronaviruses are responsible for 15 to 35% of 
common colds (McIntosh, 1974; Myint, 1994; Wege et 
al ., 1982) and have been associated with severe diarrhoea 
in the newborn (Resta et al ., 1985). Their involvement in 
neurologic diseases such as multiple sclerosis has also 
been suggested (Murray et al., 1992; Stewart et al., 1992). 


* Author for correspondence. Fax + 1 514 686 5531. 

The nucleotide sequence data reported in this paper have been 
deposited in the EMBL and GenBank nucleotide sequence databases 
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Human coronavirus OC43 comprises four or five 
major structural proteins: a peplomer (S) glycoprotein, 
an haemagglutinin-esterase (HE) glycoprotein, a nucleo- 
capsid (N) phosphoprotein and a membrane (M) 
glycoprotein (Mounir & Talbot, 1992, 1993 a; Zhang et 
al., 1992; Kamahora et al., 1989), as well as a predicted 
small membrane protein (sM) (Mounir & Talbot, 19936), 
apparently similar to the one previously identified in 
virions of infectious bronchitis virus (IBV) and TGEV 
(Liu & Inglis, 1991, Godet et al., 1992). In addition to 
these structural proteins, HCV-OC43 possesses several 
open reading frames (ORF) that could encode putative 
nonstructural (ns) proteins (Mounir & Talbot, 19936). 

The OC43 strain of the human respiratory coronavirus 
and the bovine enteric coronavirus are antigenically very 
similar since no polyclonal serum can distinguish 
between them (Hogue et al., 1984). Indeed, the predicted 
amino acid sequences of all described structural and 
nonstructural proteins of these two viruses show over 
91 % identity (Mounir et al., 1994). The major genomic 
difference observed so far between HCV-OC43 and BCV 
is the absence on HCV-OC43 of two ORFs that could 
encode putative nonstructural proteins of 4-9 and 
4-8 kDa in BCV (Mounir & Talbot, 19936; Abraham et 
al., 1990). Further sequence analysis of the BCV genome 
has revealed an additional ORF located upstream of 
mRNA 2 (Cox et al., 1989). This gene was demonstrated 
to be expressed in BCV-infected cells and to encode a 
nonstructural phosphoprotein (ns2) of 32 kDa (Cox et 
al., 1991). A similar protein has been found in MHV- 
infected cells (Luytjes et al., 1988; Bredenbeek et al., 
1990). In the present study, we have cloned, sequenced 
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Fig. 1. Complete nucleotide sequence of the ns2 protein gene of HCV-OC43 and its deduced amino acid sequence. The leader sequence 
is underlined. The intergenic consensus sequence is doubly underlined. Potential phosphorylation sites (•) are indicated. An asterisk 
marks the termination codon. 


and expressed in Escherichia coli the corresponding 
region of HCV-OC43 and identified a gene that could 
encode a protein of 32-2 kDa. With monospecific 
antisera, we have also detected the expression of this 
protein in infected cells. 

The origin and cultivation of HRT-18 cells and the 
OC43 strain of HCV has been described previously 
(Mounir & Talbot, 1992). The nucleotide sequence 
corresponding to the HCV-OC43 ns2 gene was obtained 
by RT-PCR on viral RNAs. The antisense primer 5' 
TTAGTCTTCTTCAACTGGAG 3', complementary to 
nucleotides 895 to 914 (numbering is from Fig. 1) of the 
BCV ns2 gene (Cox et al ., 1989), was used for cDNA 
synthesis. The sense primer 5' TTGTGAGCGAAGTT- 
GCGT 3', homologous to a portion of the HCV-OC43 
leader sequence (Kamahora et al., 1989; Fig. 1, nucleo¬ 
tides 1 to 18) and the antisense primer were used for 
cDNA amplification. PCR products were cloned into the 
pCRII TA cloning vector (Invitrogen). Nucleotide 
sequencing was performed on both strands of at least 
two clones of two PCR products by the dideoxy- 
nucleotide chain termination method (Sanger et al., 
1977) using T7 DNA polymerase (Pharmacia) and [a- 
35 S]dATP (Amersham). Two internal primers were used 


to complete the nucleotide sequence: 5' GCTATTCA- 
AGAAGTT 3', homologous to nucleotides 276 to 290, 
and 5' CGCCTTCTTCTAGCTCT 3', complementary 
to nucleotides 728 to 744 (numbering is from Fig. 1). 
Sequence analyses, including hydropathy plots (Kyte & 
Doolittle, 1982), were performed with the MacVector 3.5 
(International Biotechnologies, Intersciences Inc.) and 
GeneWorks 2.0 (IntelliGenetics Inc.) sequence analysis 
programs. 

As shown in Fig. 1. the region upstream of the HE 
gene of HCV-OC43 contains an ORF of 278 amino acids 
that could encode a protein with an estimated molecular 
mass of 32-2 kDa. The consensus intergenic sequence 
UCUAAAC observed upstream of many other corona- 
virus genes (Cox et al., 1989; Shieh et al., 1987) was 
found 15 nucleotides upstream of the initiation codon. 
Six potential phosphorylation sites (Fig, 1) but no 
potential /V-glycosylation sites were found in this 
predicted protein. Previous studies on the BCV ns2 
protein revealed the presence of the phosphorylated 
amino acid residues serine and threonine (Cox et al., 
1991). Interestingly, four of the six potential phos¬ 
phorylation sites are shared by BCV and HCV-OC43, 
which is consistent with the possibility that the HCV- 
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HCV-OC43 MA--VAYADKPNHFINFPLTHFQGFVLNYKGLQFQILDEGVDCKIQTAPHISLTMLDIQPEDYKSVDVAI 68 

BCV .Q.E.L.A.R. 68 

MHV-JHM . .ARM. F.AQ . S . .MGK.LK. .S.LVEM.L. . .L.KV. .V.ITL. . . KADQ . .Q.EF. . 70 

MHV-A59 -M.F.AQ . S . .MGK.LK. .S.LVEM.L. . . L . K. . .V.ITL. . .KADQ. .Q.EF. . 66 

HCV-OC43 QEVIDDMHWGDGFQIKFENPHILGRCIVLDVKGVEELHDDLVNYIRDKGCVADQSRKWIGHCTIAQLTDA 138 

BCV .E.D. 138 

MHV-JHM . .1. . .LAAYE.-D.V.D. . .M. . . .L.F. . . .E.I.EIL.RR. .T.Q. .P. . .V. .FDEE 139 

MHV-A59 ..1...LAAYE.-D.V.D...M....L....R.F....E.I.EIL.RR.,T.H..P...V..FDEG 135 


HCV-OC43 ALSIKENVDFINSMQFNYKITINPSSPARLEIVXLGAEKKDGFYETIVSHWMGIRFEYTSPTDKLAMIMG 208 

BCV .S. . . - NP. 208 

MHV-JHM K-E...-MQ.YFKLP.-.LKHN.LLTD....L..I.SS.V....CSEL.I.C.E.LC.KP..P.FSD.F. 206 
MHV-A59 K-RN.R-NE.YHKEP.-.LKHN.LLTD.G..L..I.SS.I....CSEL.V.C.E.LC.KP..P.FSD.F. 202 


HCV-OC43 YCCLDWRKELEEGDLPENDDDAWFKLSYHYENNSWFFRHVYRKSFHFRKACQNLDCNCLGFYESPVEED* 278 

BCV ...SE .VLISVSLV-.* 277 

MHV-JHM . . . IDKI.GD. .1-- - -PD.EE. .AE.QR.TY.HDN. IV. . TV. RMKG. M. * 265 

MHV-A59 ... IDKI.GD. .I. .. .QD.EE. .AE.QR.TY.HDN. IV. .TV.RMKG.M. * 261 


Fig. 2. Amino acid comparison between the ns2 proteins of human, bovine (Cox et al ., 1989) and murine (Luytjes et at., 1988; 
Bredenbeek et at ., 1990) coronaviruses by alignment for maximum identity. Dots indicate residues identical to those of HCV-OC43. 
Hyphens represent gaps introduced into the sequence to maximize alignment. 


OC43 ns2 protein is phosphorylated like its BCV 
counterpart. 

As shown in Fig. 2, the putative HCV-OC43 ns2 
protein shares 92 % amino acid sequence identity with its 
BCV counterpart. Interestingly, a stretch of nine con¬ 
secutive amino acids near the C terminus of the predicted 
protein is completely different between the two viruses. 
The presence of one additional nucleotide on HCV- 
OC43 (at position 834, Fig. 1) is responsible for this 
modification. Twenty-four residues downstream (at 
position 858, Fig. 1), the original reading frame of the 
ns2 gene is restored by insertion of two nucleotides. 
Hydropathy plots for these two ns2 proteins revealed 
that this stretch of nine amino acids is very hydrophilic 
in HCV-OC43 but highly hydrophobic in BCV (data not 
shown). Thus, such a drastic change in this region of the 
protein could affect the secondary and tertiary structure 
of the ns2 protein and consequently modify its biological 
function. The identity levels between the ns2 proteins of 
HCV-OC43 versus those of MHV-A59 and MHV-JHM 
are 49% and 51%, respectively (Fig. 2). 

The ns2 proteins of MHV-A59, MHV-JHM and BCV 
possess three nucleotide binding domains (Cox et al., 
1991; Luytjes et al., 1988). Even though the HCV-OC43 
ns2 protein possesses a similar nucleotide binding 
domain, it is unlikely that this protein could interact with 
RNA since its isoelectric point is 4-8. Indeed, the negative 
charge of the ns2 protein and RNA would result in 
mutual repulsion of these molecules at physiological pH. 

To analyse the expression of the HCV-OC43 ns2 
protein in infected cells, immunofluorescence assays were 
performed on HRT-18 cells infected with HCV-OC43 at 
an m.o.i. of 0 05. At 66 h post infection, the cells were 
fixed with acetone and incubated first with a mouse 


antiserum specific for a recombinant form of the BCV 
ns2 protein (a kind gift of Dr Pascal Boireau, Centre 
national d’etudes veterinaires et alimentaires, Maisons- 
Alfort, France), then with fluorescein-conjugated F(ab') 2 
fragments of goat anti-mouse antibody (Cappel Research 
Products, Organon Technika Inc.) and observed under 
UV light. As shown in Fig. 3, specific fluorescence was 
distributed uniformly around the nucleus, which is 
consistent with the expression of the ns2 protein in the 
cytoplasm (Fig. 3 b). As expected, uninfected cells showed 
no fluorescence (Fig. 3 a). Negative and positive controls 
were performed on infected cells with a preimmune 
serum (Fig. 3 c) and with a murine monoclonal antibody 
directed against the HEV nucleocapsid protein (a kind 
gift of Dr Serge Dea, Institut Armand-Frappier, Laval, 
Quebec, Canada) (Fig. 3 c?). Similar results were obtained 
with a monospecific rabbit antiserum produced against 
the HCV-OC43 ns2 protein. To produce this antiserum, 
the HCV-OC43 ns2 gene was reamplified by PCR with 
sense and antisense primers to which BamHl sites were 
added. The PCR product was purified, digested with 
BamHl and ligated into the UamHI-digested pMAL-c2 
vector (New England Biolabs). The resulting plasmid 
was introduced into E. coli. Clones containing the insert 
were sequenced to verify the integrity of the ns2 gene. 
The production of the maltose-binding protein (MBP) 
fused to the N terminus of the ns2 protein was induced 
in selected clones with 0-3 mM-isopropyl /Lthiogalactos- 
ide. The MBP-ns2 fusion protein was purified by affinity 
chromatography on an amylose resin (New England 
Biolabs) for production of a monospecific rabbit anti¬ 
serum. Approximately 300 jig of the purified fusion 
protein was injected subcutaneously into a 3 kg New 
Zealand White female rabbit. The antigen was first 
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Fig. 3. Indirect immunofluorescence staining of HCV-OC43-infected cells. At 66 h after infection, cells were fixed and stained with 
primary mouse antibodies, followed by FITC-conjugated goat anti-mouse secondary antibody, (a) Uninfected cells stained with an ns2- 
specific mouse antiserum; (b) infected cells stained with an ns2-specific mouse antiserum; (c) infected cells stained with a preimmune 
mouse antiserum; ( d ) infected cells stained with an N-specific murine monoclonal antibody. 


administered with complete Freund’s adjuvant and 
subsequent injections (every 2 weeks) used incomplete 
Freund’s adjuvant. The antiserum was collected when 
optimal levels of specific antibodies to the MBP-ns2 
fusion protein were reached; this required seven in¬ 
jections, an indication of the low immunogenicity of this 
protein, at least in rabbits. 

In conclusion, we have shown that FICV-OC43 does 
express a protein of 32 kDa encoded by mRNA 2 (ns2), 
as was previously observed with BCV (Cox et al., 1991) 
and MHV (Bredenbeek et al., 1990; Zoltick et al., 1990). 
It is noteworthy that the ns2 gene has so far only been 
observed in coronaviruses of the same antigenic group, 
in which an HE gene is also present, except for MHV- 
A59, which does nevertheless contain a pseudogene 
(Luytjes et al., 1988; Shieh et al., 1989). Therefore, 
possible physical and/or functional interactions between 
the ns2 and HE proteins or genes need to be investigated. 

This completes the characterization of the region 
downstream of the polymerase gene of HCV-OC43. The 
polymerase gene has been shown in other coronaviruses 
to encode several putative nonstructural polypeptides 
that have yet to be fully characterized (Boursnell et al., 
1987; Denison et al., 1991; Herold et al., 1993). The 
genomic organization of this human coronavirus is thus 


as follows: 5' pol/ns2/HE/S/ns4/sM/ns5-l/M/N 3'. 
The high amino acid identities (91 to 97%) observed 
between HCV-OC43 and BCV structural and non¬ 
structural proteins suggests that the two viruses have 
diverged only recently. However, they have developed 
different pathogenic properties, including an apparent 
lack of replication of HCV-OC43 in a bovine host (P. J. 
Talbot & L. A. Babiuk, unpublished observations). The 
previously reported genomic deletion in HCV-OC43 
compared to BCV (Mounir & Talbot, 19936; Abraham 
et al., 1990), remains the only major structural difference 
between the two viruses, although its relevance to the 
biology of these structurally related viruses, one a 
recognized respiratory pathogen (HCV-OC43) and the 
other a recognized enteric pathogen (BCV), remains to 
be established. Genetic engineering of the HCV-OC43 or 
BCV genome will most likely be necessary for a definitive 
characterization of the molecular basis of virus tropism. 
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